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Abstract 

Background: In order to obtain a lead of the pathophysiology of endometriosis, genonne-wide expressional 
analyses of eutopic and ectopic endonnetriunn have earlier been reported, however, the effects of stages of severity 
and phases of nnenstrual cycle on expressional profiles have not been examined. The effect of genetic 
heterogeneity and fertility history on transcriptional activity was also not considered. In the present study, a 
genome-wide expression analysis of autologous, paired eutopic and ectopic endometrial samples obtained from 
fertile women (n = 18) suffering from moderate (stage 3; n = 8) or severe (stage 4; n = 10) ovarian endometriosis 
during proliferative (n = 13) and secretory (n = 5) phases of menstrual cycle was performed. 

Methods: Individual pure RNA samples were subjected to Agilent's Whole Human Genome 44K microarray 
experiments. Microarray data were validated (P<0.01) by estimating transcript copy numbers by performing real 
time RT-PCR of seven (7) arbitrarily selected genes in all samples. The data obtained were subjected to differential 
expression (DE) and differential co-expression (DG) analyses followed by networks and enrichment analysis, and 
gene set enrichment analysis (GSEA). The reproducibility of prediction based on GSEA implementation of DG results 
was assessed by examining the relative expressions of twenty eight (28) selected genes in RNA samples obtained 
from fresh pool of eutopic and ectopic samples from confirmed ovarian endometriosis patients with stages 3 and 4 
(n = 4/each) during proliferative and secretory (n = 4/each) phases. 

Results: Higher clustering effect of pairing (cluster distance, cd = 0.1) in samples from same individuals on 
expressional arrays among eutopic and ectopic samples was observed as compared to that of clinical stages of 
severity (cd = 0.5) and phases of menstrual cycle (cd = 0.6). Post hoc analysis revealed anomaly in the expressional 
profiles of several genes associated with immunological, neuracrine and endocrine functions and gynecological 
cancers however with no overt oncogenic potential in endometriotic tissue. Dys-regulation of three (GLOGK, ESRl, 
and MYG) major transcription factors appeared to be significant causative factors in the pathogenesis of ovarian 
endometriosis. A novel cohort of twenty-eight (28) genes representing potential marker for ovarian endometriosis 
in fertile women was discovered. 

Conclusions: Dysfunctional expression of immuno-neuro-endocrine behaviour in endometrium appeared critical to 
endometriosis. Although no overt oncogenic potential was evident, several genes associated with gynecological 
cancers were observed to be high in the expressional profiles in endometriotic tissue. 
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Background 

Endometriosis is a complex disorder involving pathogen- 
esis and clinical presentation of ectopically implanted 
endometrium [1]. It is generally assumed that elucidation 
of molecular expressional specificities of eutopic and ec- 
topic endometrium may provide leads towards a better 
understanding of the pathophysiology of the disorder [2] . 
To this end, several studies exploring the differential ex- 
pression of genes between autologous eutopic and ec- 
topic endometrium from patients with endometriosis 
have been reported, however, with no specific compari- 
son for stages of severity, fertility history and phases of 
menstrual cycle [3-7], except a recent report [8]. More 
over, it is notable that two types of endometriosis, namely 
ovarian endometriosis and peritoneal endometriosis re- 
portedly show differential characteristics [4,9]. Further- 
more, there is evidence to support the idea that deep 
infiltrating endometriosis also show differential patho- 
physiology as compared to ovarian and peritoneal endo- 
metriosis [10,11]. In the present study, we examined a 
genome-wide large-scale transcript survey of autologous, 
paired eutopic and ectopic endometrial samples obtained 
from fertile women suffering from moderate to severe 
ovarian endometriosis, and excluded cases of peritoneal 
endometriosis and deep infiltrating endometriosis. We 
assumed that the present model of subject selection 
would reduce the impact of biological noise derived from 
genetic and pathogenetic heterogeneity and subfertility- 
associated variability on the transcriptional activity in the 
target tissue. We report here for the first time that clus- 
tering effect of expressional arrays among eutopic and 
ectopic samples was higher for genetic homogeneity 
(i.e. pairing of eutopic and ectopic samples from same 
individuals) than that of clinical stages of severity and 
phases of menstrual cycle. Based on the present 
transcriptomics data, we have also hypothesized that dys- 
functional immuno-neuro-endocrine behaviour in endo- 
metrium was associated with the pathogenesis of 
endometriosis. Additionally, we did not observe an overt 
oncogenic potential in the expressional profiles in endo- 
metriotic tissue, however, several genes associated with 
gynecological cancers were highly expressed in the euto- 
pic and ectopic endometrium. Finally, a novel cohort of 
28 genes was identified, the expression of which carry po- 
tential marker value for endometriosis in fertile women. 
A flow diagram of the experimental design is shown in 
Figure 1. 

Methods 

Subjects and tissue samples 

The present study was approved by the Ethics 
Committee on the Use of Human Subjects, AU India 
Institute of Medical Sciences (AIIMS), New Delhi. The 
patients enroUed in the Department of Obstetrics and 



Gynecology - AIIMS and showing evidence of endome- 
triotic lesions, adhesions and endometriotic cyst were 
selected to participate in the present study. AU the 
patients were reportedly fertile and referred from the 
Pain Clinics, and had voluntarily agreed to donate their 
samples after understanding the purpose of the proposed 
study. Signed informed consent was obtained from each 
participant of this study. As shown in Figure 1, twenty- 
six (26) normally cycling and proven fertfle women (age: 
24-45 y) with history of pregnancy and with at least one 
living biological offspring, and body mass indices within 
normal ranges (20-22 k/m^) having ovarian endometri- 
osis were selected for the present study. Confirmation of 
ovarian endometriosis and exclusion of other types of 
endometriosis was achieved from reports of pelvic im- 
aging based on ultrasound, MRI and/or diagnostic lapar- 
oscopy as described elsewhere [8]. Severity stages 3 and 
4 of the disease condition were defined at the time of 
surgical laparoscopy [8] according to rASRM protocol 
[12]. Selected subjects (n = 18; shown as in Additional 
file 1: Table SI) contributed their eutopic (shown as A' 
in Figure 2) and ectopic (shown as 'B' in Figure 2) sam- 
ples during proliferative (days 9-14) phase (n = 17) and 
secretory (days 17-24) phase (n = 8) of menstrual cycle 
as described elsewhere [8]. Additional paired samples 
coUected from different group of subjects (n = 8; shown 
as 'Ep' in Additional file 1: Table SI) with confirmed 
ovarian endometriosis as described above and with clas- 
sified menstrual (proliferative: n = 4; secretory: n = 4) 
phases and severity stages 3 (n = 4) and 4 (n = 4) were 
employed for validating the prediction as described 
below. A smaU piece from each specimen was processed 
for chemical fixation in neutral buffered formaldehyde 
(4%, w/v) for subsequent confirmation of phase of cycle, 
state of pathology and ceU types from eutopic and ec- 
topic samples, and the residual portions were trans- 
ported on ice to the laboratory within 10 minutes of 
collection for further processing for RNA extraction. 

Experimental procedure 

The methodological details of RNA extraction foUowed 
by the estimation of its yield and purity using standard 
electrophoretic and spectrophometric protocols and its 
RIN score using the Agilent 2100 Bioanalyzer, RNA 
6000 Nano LabChip kit and Agilent 2100 Expert Soft- 
ware (Agilent Technologies, Santa Clara, CA, USA) have 
been given elsewhere [8,13]. Individual RNA samples 
from eutopic and ectopic tissue samples (n = 18) from 
confirmed stages 3 (n = 8) and 4 (n = 10) coUected during 
proliferative (n = 13) and secretory (n = 5) phases and 
having RIN scores >8.0 were subjected to whole tran- 
scriptome array experiment using the Agilent Whole 
Human Genome 60-mer 4X44K microarray according to 
the manufacturers recommendations. Thus, seven (7) 
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Figure 1 Flow diagram of the experimental design showing overall aim and work plan of the present study. 



samples could not be used either for insufficient RNA 
yield or RIN scores (see Additional file 1: Table SI for 
the subject details of the selected samples). Hybridized 
arrays were scanned with Agilent's G2505B microarray 
scanner system and the raw data were imported into 
GeneSpring 11.5.0 software (Agilent Technologies, Santa 



Clara, CA, USA) for further analysis. Pearsons correl- 
ation coefficients done to assess the reliability of data 
obtained from two separate hybridization runs for same 
RNA preparation for four (4) eutopic and ectopic sam- 
ples confirmed the reproducibility assurance (P<0.01) 
among hybridizations. Analysis of the data retrieved 
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Figure 2 General descriptive characteristics of expressions in eutopic and ectopic endometrium. (A) Histogram of frequency distribution 
of probes (genes) for different groups of expression levels (in log2) in eighteen (18) autologous, paired eutopic {blank bar) and ectopic 
{hatched bar) endometrial samples. (B) Two-way representation of unsupervised hierarchical cluster analysis (HCA) of the expression levels (in 
logarithmic scale) of all the target probes/genes (Y-axis) in each sample (each column), eutopic labelled as A and ectopic labelled as B from 
all subjects (n = 18) and their clustering based on expressional distance (Pearson correlation coefficient) between samples in dendrogram 
formation (X-axis). Each horizontal line represents a single probe, and each column represents a single sample. Relative expression of each 
probe is colour-coded: high {red) and low {blue), as indicated in the colour legend. Categorical annotations of each sample are shown in the 
X-axis. The samples cluster by cycle phase and severity stages, as shown by the bar at the bottom of the heat map: proliferative phase {black 
bar), secretory phase {crossed white bar), stage 3 {red crossed bar) and stage 4 {blank red bar). A, eutopic; B, ectopic; P, proliferative phase; S, 
secretory phase; 3, stage 3; 4, stage 4. 



from separate chips with the same RNA samples yielded 
QC statistics highly concordant with that of the manu- 
facturer, and it revealed more than 95% confidence level 

Data analysis 

Unsupervised and supervised hierarchical clustering ana- 
lysis (HCA), and non-hierarchical K-mean cluster 



analysis of expression arrays were performed with the 
help of GeneSpring software 11.5.0. Analysis of variance 
followed by pair- wise differential (>3-fold at P < 0.01) ex- 
pression (DE) for specific genes between eutopic and ec- 
topic samples, as well as, between proliferative and 
secretory phases, and between clinical stage 3 and stage 
4 of severity for eutopic endometrium, and for ectopic 
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endometrium, respectively were done using multiple 
comparison tests as described elsewhere [14]. 

Post-hoc analysis 

Networks and enrichment analysis were done using gene 
lists obtained from the above analyses and based on a 
priori setting of a cut-off threshold (pFDR(p) = 0.05) with 
the help of the GeneSpringl 1.5.0 software and Metacore 
platform (GeneGo, St. Joseph, MI, USA). The K-mean 
clusters were further used for differential co-expression 
(DC) analyses and analyzed in terms of Gene Ontology 
(GO) enriched categories using GeneSpringl 1.5.0 soft- 
ware. Gene Set Enrichment Analysis (GSEA) version 3.7 
was applied to each of the K-mean clusters independently 
to examine at FDR < 0.25 for not less than 10 genes for a 
set with a maximum of 1000 permutation whether pre- 
annotated BROAD gene sets [15]: CI (cytogenetic sets), 
C2 (functional sets), C3 (regulatory sets), C4 (cancer 
neighborhood sets), and C5 (gene ontology sets) could 
identify any interesting information in the DC sets [16]. 

Quantification of candidate gene expression by real time 
RT-PCR 

In order to validate the microarray data, relative expres- 
sion of arbitrarily chosen seven (7) selected genes {ATX, 
DDHDl, DYNLTl, FTHl, LAMRl, MIER2, and WDR87) 
in eutopic and ectopic samples collected from all 
patients were performed using Taqman multiplexing 
technology on iCycler iQTm real time RT-PCR detection 
system (BioRad, Hercules, CA, USA). GAPDH was 
selected as an endogenous control based on its observed 
expressional consistency in arrays on data analysis. Pri- 
mers and probes were designed on Beacon Designer 
software7 (Labware Scientific Inc., Milipitas, CA, USA) 
and obtained from Qiagen (Cologne, Germany) (see 
Additional file 2: Table S2 for the details). The ratio of 
estimated efficiency of the primers for the selected genes 
and GAPDH was -1.0. An optimized kit (QuantiTect 
multiplex PCR kit, Qiagen, Cologne, Germany) was used 
to synthesize cDNA from respective RNA (5 [ig) sam- 
ples. Relative expression ratios between groups were cal- 
culated by using 2'^^^^ method [17]. Quantification of 
copy numbers for target transcripts in complex RNA 
samples was obtained as described elsewhere [18]. Com- 
parison between fold change data obtained from real 
time RT-PCR and microarray image analysis for selected 
seven (7) genes revealed a high degree of concordance 
and pattern similarity in expression profile. Concordance 
correlation test between real time RT-PCR based quanti- 
tative data and microarray data for the seven (7) genes 
showed a high degree of correlation (P < 0.01) [8]. 

In order to test the reproducibility of prediction 
derived from analysis of microarray results, the relative 
expressions of twenty eight (28) selected genes in 



individual RNA samples obtained from eight (8) add- 
itional subjects giving paired eutopic and ectopic samples 
with confirmed endometriosis stages 3 (n = 4) and 4 
(n = 4) during proliferative (n = 4) and secretory (n = 4) 
were examined using real time RT-PCR technology. The 
subject details are shown in Additional file 1: Table SI. 
The genes selected based on GSEA implementation of 
DC results were employed to test the predictability func- 
tion of the expression of those genes. The details of RNA 
methodologies are given above. GAPDH was selected as 
an endogenous control based on its observed expres- 
sional consistency in arrays on data analysis. All primers 
were designed on the Beacon Designer software7.0 (Lab- 
ware Scientific Inc., Milipitas, CA, USA) based on SYBR 
green chemistry and obtained from Qiagen (Cologne, 
Germany). QuantiTect Reverse Transcription kit for 
cDNA synthesis and QuantiPast SYBR green PCR kit for 
PCR amplification from Qiagen (Cologne, Germany) 
were used according to the protocol given by the manu- 
facturer. The estimates of relative expression ratios be- 
tween groups and copy numbers for target transcripts in 
complex RNA samples were obtained as described above. 

Results 

The data sets are available at NCBI-GEO website [19]. 

A distribution histogram of the number of probes and 
genes for different ranges of expression in autologous, 
paired eutopic and ectopic samples obtained from eighteen 
(18) fertile women with confirmed ovarian endometriosis 
is shown in Figure 2A. Total numbers and per cent esti- 
mates of probes/genes expressed in eutopic and ectopic 
samples in optimized scale are shown in Table 1. On aver- 
age, -75% and -50% of expressed genes showed marked 
signal in eutopic and ectopic samples, respectively. 

Table 1 Descriptive analysis of array data 

Parameter Estimate 



Per chip Per cent 



Number of probes 41 000 




(genes) (29421) 




Number of hybridized probes (genes)^ 




Eutopic 35646 


87 


(25987) 


(88) 


Ectopic 35587 


87 


(26222) 


(89) 


Number of high expressed probes (genes)'' 




Eutopic 23267 


65 


(19168) 


(74) 


Ectopic 15912 


45 


(13681) 


(52) 



^Hybridization signal more tlian mean optimized background signal ±2SD. 
'^>0 in normalized log2 scale. 
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Unsupervised HCA yielded marked segregation of samples 
into two major clustering branches with clustering cohe- 
sion being highest (cluster distance, cd: 0.1) between paired 
samples from same subjects. However, clustering cohesion 



was only moderate in samples which were classified based 
on either severity stages (cd: 0.5) or phases of menstrual 
cycle (cd: 0.6) (Fig. 2B). Supervised HCA revealed that the 
ectopic location of tissue had a higher clustering effect (cd: 
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Figure 3 Venn analysis of distribution of differentially expressed (DE) genes in eutopic-to-ectopic analysis. Distribution of DE genes in 
(A) eutopic and ectopic samples of stage 3 and stage 4, and proliferative and secretory phases, (B) stage 3-to-stage 4 for eutopic and ectopic 
samples, and (C) proliferative-to-secretory phases for eutopic and ectopic samples. Common genes among comparative groups are detailed in 
respective tables along with the vector of regulation and fold changes. The number of genes with relative up-regulation and down-regulation 
are shown by respective arrows. For details of DE genes, see Additional file 3: Table S3. Note that the areas in the Venn distribution analysis are 
not drawn to scale. 
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0.2) than that of phases of cycle (cd: 0.3), but not than that 
of the clinical stages of severity (cd: 0.1). 

Differential expression (DE) 

Additional file 3: Table S3 gives the list of the genes 
along with their differential expression (DE) patterns 
under different categories based on expressional arrays 
in autologous, paired eutopic and ectopic samples 
obtained from 18 fertile women with ovarian endometri- 
osis. Figure 3 shows the number of genes with DE in dif- 
ferent categories of comparison and the lists of common 
genes in it. Table 2 highlights the enriched categories of 
pathways for the common genes from above-mentioned 
DE analysis between eutopic and ectopic endometrium. 
It appeared that different signaling pathways associated 
with immune response, several neuronal processes, and 
ERBB family signaling pathways were commonly 
selected. A summary of DE analysis of the non-common 
genes showing differential display under different cat- 
egories and their enrichment analysis are shown in 
Table 3. Collectively, it appeared that informational flow 
for a wide array of pathways involving cellular signaling, 
apoptosis and survival, cytoskeleton remodeling, chemo- 
taxis, cell adhesion, immune response and several neuro- 
physiological processes were affected. 

K-mean clusters and differential co-expression (DC) 

As shown in Figure 4, K-mean cluster analysis identified 
four clusters of expression patterns and profiles based 
on normalized hybridization signals for all expressed 
genes in aU samples. The genes in cluster 1 (Kl) did not 
show any specific expression pattern, while other three 



clusters showed overt patterns for menstrual cycle 
phases and severity stages. A large number of genes 
belonging to cluster 2 (K2) showed over-expression in 
severity stage 4 secretory phase endometrium (Fig. 4B). 
The co-expressed genes in cluster 3 (K3) and cluster 4 
(K4) showed very similar patterns with an overall higher 
expression in stage 3 as compared to stage 4 endomet- 
rium samples irrespective of cycle phases. 

Table 4 shows the pathways -based enrichment analysis 
of groups of genes in four (4) K-mean clusters revealing 
differential co-expression (DC) profiles between paired 
eutopic and ectopic endometrial tissues. It essentially 
substantiated the observation obtained from DE analysis 
that transcriptomic signals related to cell cycle, signal 
transduction, cytoskeleton remodeling, apoptosis and 
survival, chemotaxis, cell adhesion, and immune re- 
sponse were affected in the pathogenesis process of 
endometriosis. 

Gene-set enrichment analysis (GSEA) 

Table 5 provides a summary of the results of GSEA im- 
plementation on co-expressed genes with differential 
display (DC) in the four K-mean clusters. In K4, one (1) 
cytoband i.e. CI set and two (2) gene ontology i.e. C5 
sets were selected. More over, three (3) DC gene sets - 
one each in Kl, K2 and K4, respectively - were selected 
under BROAD regulatory gene motif sets, C3. It is not- 
able that two (2) selected regulatory motif sets belonging 
to Kl and K2 were significantly (p< 0.0001) associated 
with ectopic sample as evident from their negative nor- 
malized enrichment scores (NES). Further, four (4) DC 
gene sets - two (2) each in Kl and K4, respectively - 



Table 2 Enriched common genes showing differential changes under different categories of comparisons^ 


Description of comparison 
(Number of genes) 


Gene in enriched 
category (Gene symbol) 


Enriched pathways 


(p-value) 


Eutopic-to-ectopic 


ERBB3 


Activation of astroglia proliferation 


(0) 


Stage 3 & Stage 4 (3)^ 




CDK5 mediated cell death and survival 


(0) 


Proliferative & Secretory (4)^ 




ERBB family signaling 

Membrane bound ESRl interaction with 
growth factor signaling 

Ligand-independent activation of ESRl and ESR2 


(0) 

(<0.01) 
(<0.01) 




ERBB3, LAMC2 


Alpha6/beta-4 integrins in carcinoma progression 


(<0.01) 


Stages 3-to-4 


STAT2 


Immune response involving IL-15 and IFN signaling 


(<0.02) 


Eutopic & Ectopic (50)"^ 








Proliferative-to-Secretory 




Angiotensin signaling via STATs 


(<0.03) 


Eutopic & Ectopic (4)"^ 


NOSIAP 


nNOS signaling in neuronal process 


(<0.03) 




APIBI 


Immune response involving regulation of 
T cell function by CTLA-4 


(<0.04) 




S0CS5 


Immune response involving IL-4 signaling 


(<0.04) 




G0T2 


GABA biosynthesis and metabolism 


(<0.05) 



^see Figure 2, ""A, and ""C. 
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Table 3 Estimates and enriched categories of differentially^ regulated non-common genes 



Specific analysis 



Nature of differential 
change [Number 
of genes] 



Top enriched pathways 



[Gene symbol(s) of 
major candidate(s)] 



(p-value) 



Pooled 



Stage 3 



Stage 4 



Up-reguloted [50] WNT signaling 

DNA damage-induced responses 
and apoptosis 

Role of 14-3-3 proteins in cell 
cycle regulation 

Cadherins mediated cell adhesion 

Endothelial cell contacts by 
non-junctional mechanisms 

Role of SCF complex in cell cycle 
regulation 

ATM/ATR regulation of cell cycle 

nNOS signaling in neuronal synapses 

Activation of astroglial cell 
proliferation by ACM3 

G-protein signaling in RhoA 
regulation pathway 

CDK5 in apoptosis and survival 

ERBB-family signaling 

Regulation of EIF2 activity 
associated with translation 

Ligand-independent activation of 
ESRl and ESR2 

Non-genomic action of 
androgen receptor 

Down-regulated [41] Regulation of glucose and 

lipid metabolism 

GDNF signaling 

Immune response involving antigen 
presentation by MHC class I 

Chemotaxis involving CCR4-induced 
leukocyte adhesion 

Up-regulated [4] No specific enriched category identified 

Down-regulated [48] Cytoskeleton remodeling involving 

RalB and RalA regulation pathway 

Clathrin coated vesicle formation 

Transcriptional silencing involving 
HPl family 

G-protein signaling involving interaction 
among Ras-family GTPases and 
K-RAS/N-RAS/H_RAS regulation pathway 

Up-regulated [31] Cell contraction involving relaxin 

and GPCRs 

Development involving 
endothelin-l/EDNRA signaling 

DNA damage induced apoptosis 
and DNA repair 

Beta-2 adrenergic dependent 
CFTR expression 

Regulation of lipid metabolism 

Alpha-1 adrenergic receptor signaling 



[NRCAM, WNT16] 
[CHEKl] 

[CHEKl] 

[CHP] 
[CHP] 

[CHEKl] 

[CHEKl] 
[RASDl] 
[ERBB3] 

[ARHGAP26] 

[ERBB3] 
[ERBB3] 
[CSNKIGI] 

[ERBB3] 

[WNT16] 

[APOE] 

[ITGBl] 
[HLA-C] 

[ITGBl] 

[RALGDS] 

[MYOID] 
[PFDN5] 

[RALGDS] 

[ADCY6, EDNRA, RXFPl] 

[ADCY6, EDNRA] 

[NBN] 

[ADCY6] 

[PPARA] 
[ADCY6] 



(0) 

(<0.01) 

(<0.02) 

(<0.03) 
(<0.03) 

(<0.03) 

(<0.04) 
(<0.03) 
(<0.04) 

(<0.04) 

(<0.04) 
(<0.05) 
(<0.05) 

(<0.05) 

(<0.05) 

(0) 

(<0.04) 
(0.05) 

(<0.05) 

(<0.01) 

(<0.02) 
(<0.02) 

(<0.03) 



(0) 

(<0.01) 

(<0.01) 

(<0.02) 
(<0.02) 
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Table 3 Estimates and enriched categories of differentially^ regulated non-common genes (Continued) 



Proliferative phase 



Down-reguloted [3] 



Up-regulated [109] 



Down-reguloted [20] 



Secretory phase 



Up-regulated [17] 



Mu- and kappa-type opioid receptor 
mediated pliysiological process 

Mucin expression via IL-6, IL-17 
signaling patliways 

G-protein signaling 

Transport from Golgi and ER to the 
apical membrane 

Intracellular cholesterol and 
sphingolipids transport 

RAS regulation pathway 

TC21 regulation pathway 

Regulation of CDC42 activity 

Sin3 and NuRD mediated 
transcription regulation 

GDNF family signaling 

Phospholipid metabolism 

Immune response involving 
CD40 signaling 

Cytoskeleton remodeling involving a-lA 
adrenergic receptor 

Dependent inhibition of PI3K and 
regulation of actin by Rho GTPases 

Cell contraction involving 6-type 
opioid receptor, S1P2 receptor, ACM 

Development associated MAG dependent 
inhibition of neurite outgrowth 

Development associated with TGF-beta 
dependent induction of EMT via 
RhoA, PI3K and ILK 

Cell adhesion involving histamine 
HI receptor 

Cell adhesion and chemotaxis 
involving integrin 

Chemotaxis involving inhibitory 
action of lipoxins on 
IL-8 and leukotriene B4-induced 
neutrophil migration 

GPCRs in platelet aggregation 

Immune response involving CCR3 
signaling in eosinophils 

Oxidative phosphorylation 

Transport involving RAN 
regulation pathway 

Immune response involving 
MIF-JABl signaling 

nNOS signaling in neuronal synapses 
and circadian rhythm 

Cell cycle associated spindle assembly 
and chromosome separation 

Regulation of lipid metabolism 

Regulation of glycogen metabolism 

Progesterone mediated maturation 



[ADCY6] 

FRAF3IP2] 

[ADCY6] 
[PPIA] 

[PPIA] 

[BCR, RASGRFl] 
[BCR, RASGRFl] 
[BCR, FGFRl] 
[CHD3, SIN3A] 

[GFRA2, NRTN] 
[GPD2, NRTN] 
[IRF1,TRAF3IP2] 



[LAMBl, MYL12B] 
[MYL12B] 
[MYL12B] 
FPMl] 

[MYL12B] 
[LAMBl, MYL12B] 
[MYL12B] 

[MYL12B] 
[MYL12B] 

[UQCRll] 
FNPOl] 

[PGR] 

[RASDl] 

FNPOl] 

FNPOl] 

[AGL] 

[PGR] 



(<0.03) 

(<0.04) 

(<0.04) 
(0) 

(<0.01) 

(0) 
(0) 

(<0.01) 
(<0.01) 

(<0.01) 
(<0.02) 
(<0.02) 



(<0.01) 
(<0.01) 
(<0.01) 
(<0.01) 

(<0.01) 
(<0.01) 
(<0.01) 

(<0.02) 
(<0.02) 

(<0.03) 
(<0.01) 

(<0.01) 

(<0.02) 

(<0.02) 

(<0.02) 
(<0.02) 
(<0.02) 
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Table 3 Estimates and enriched categories of differentially^ regulated non-common genes (Continued) 



Down-regulated [122] 



Eutopic 



Up-reguloted [182] 



Cell adhesion associated 
ECM remodeling 

TGF-beta receptor signaling 
in development 

Cell contraction involving 6-type 
opioid receptor 

Development associated 
Slit-Robo signaling 

Insulin mediated regulation 
of translation 

Leukotriene 4 biosynthesis 
and metabolism 

Chemotaxis involving inhibitory 
action of lipoxins on IL-8 and 
leukotriene B4-induced 
neutrophil migration 

Endoplasmic reticulum stress 
response pathway 

Immune response involving CCR3 
signaling in eosinophils 

GTP-XTP metabolism 

Cytoskeleton remodeling via RalB 
regulation pathway 

Cytoskeleton remodeling involving 
ACM3 and ACM4 

G-protein signaling involving regulation 
of cAMP levels by ACM 

Transcription involving Tubby signaling 
and HPl family 

Regulation of lipid metabolism involving 
G-alpha(q) regulation 

Cell contacts by non-junctional mechanisms 

NMDA -dependent neurophysiological 
process 

Cell cycle at metaphase check point 

G-protein signaling involving RaplA 
regulation pathways 

Regulation of translation through 
EIF4F activity 

Regulation of translation by alpha-1 
adrenergic receptors 

Development involving 
endothelin-l/EDNRA signaling 

Cytoskeleton remodeling via 
FAK signalin 

Immune response involving PGE2 
common pathways 

Immune response involving IL-17 
signaling pathways 

Cell contraction via oxytocin signaling 

Transcription via PPAR pathway 

Regulation of lipid metabolism through 
alpha-1 adrenergic receptors signaling 
via arachidonic acid 



[MME] (<0.03) 

FNPOl] (<0.03) 

[MYL9] (0) 

[DPYSL2, R0B03] (<0) 

[EIF4EBP1, PPPICC] (<0.01) 

[GGT5, LTA4H] (<0.01) 

[MYL9, RAC2) (<0.01) 

[ATF4, PPPICC] (<0.01) 

[MYL9, RAC2] (<0.03) 

[GUKl, NME3, P0LR3H] (<0.03) 

[RALGDS] (<0.04) 

[CHRM4, GNAQ] (0) 

[CHRM4, GNAQ] (0) 

[GNAQ] (0) 

[GNAQ, PTGS2] (0) 

[ITGA5, MAGI, PECAMl] (<0.01) 

[GNAQ, GRIN2A] (<0.01) 

[CBX3, INCENP] (<0.01) 

[MAGIl, RAPGEFl] (<0.02) 

[EIF4A2] (<0.03) 

[EIF4A2, GNAQ] (<0.03) 

[GNAQ, NPPB] (<0.03) 

[GNAQ, RAPGEFl] (<0.04) 

[GNAQ, PTGS2] (<0.03) 

[CXCL3, PTGS2] (<0.04) 

[GNAQ, PTGS2] (<0.04) 

[MEDl, PTGS2] (<0.04) 

[GNAQ, PTGS2] (<0.05) 
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Table 3 Estimates and enriched categories of differentially^ regulated non-common genes (Continued) 



Down-reguloted [48] 



Ectopic 



Up-regulated [665] 



Cell cycle regulation involving 
SCF complex 

p53 regulation involving SUMO 

Regulation of degradation and 
traffic of CFTR 

WNT signaling pathway involving 
degradation of bete-catenin 

Transcriptional silencing involving HPl family 

Immune response involving IL-12 and 
MIF-JABl signaling pathways 

Angiotensin signaling via beta-arrestin 

ATM/ATR regulation of Gl/S and 
G2/M checkpoints 

NGF signaling for apoptosis and survival 

and activation of NF-kB 

Neurophysiological process involving 
GABA-A receptor life cycle 

Regulation of translation initiation 



Transition and termination of 
DNA replication 

Activin A signaling regulation 

Beta-2 adrenergic-dependent 
CFTR expression 

Mu-type opioid receptor mediated 
neurophysiological process 

Development involving alpha-1 and 
beta-adrenergic 

receptors signaling via cAMP and 
PIPS signaling 

Cell adhesion involving ephrin signaling 



Transport involving RAB3 
regulation pathway 

Neurophysiological process involving 
corticoliberin signaling via CRHRl 

Signal transduction involving cAMP 
and PKA signaling 

G-protein signaling involving 
G-Protein beta/gamma 
signaling cascades 

G-protein signaling involving RhoA 
regulation pathway 

eNOS activity in cell contraction 

MAG-dependent inhibition of 
neurite outgrowth 

Neurophysiological process involving 
delta-type opioid receptor 

Neurophysiological process involving 
HTRIA receptor signaling 

Neurophysiological process involving 
melatonin signaling 



[CDC34, NEDD8, UBA52] (0) 

[UBA52] (<0.02) 
[DYNLLl, UBA52] (<0.02) 



[UBA52] (<0.03) 

[PFDN5] (<0.03) 
[UBA52] (<0.03) 

[CLTA, UBA52] (<0.03) 
[UBA52] (<0.04) 

[EPB41L1] 

[UBA52] (<0.04) 
[CLTA] (<0.04) 

[EIFl, RPL7, RPL12, RPL15, (<0.04) 
RPL21, RPL22, RPL29, RPS3A, 
RPSIO, RPS14, UBA52] 

[UBA52] (<0.04) 

[UBA52] (<0.05) 

[ADCY2, ADRB3, CREBl, (0) 
PRKARIB, PRKAR2B] 

[ADCY2,ADCY5, CREBl, (0) 
HPCA, PRKARIB, PRKAR2B] 

[ADCY2, ADCY5, AHT3, (0) 
CREBl, F0X03, GABl, 
PRKARIB, PRKAR2B, YWHAE] 

[ADAM 10, EFNA5, (0) 
EPHA4, EPHB6] 

[DMXL2, RAB3B] (0) 

[ADCY2, ADCY5, CACNAIC, (0) 
CREBl, PRKAR2B, IVL] 

[ADCY2, ADCY5, CACN Al C, (0) 
CREBl, PRKARIB, PRKAR2B, 
PCTKl] 

[ADCY2, ADCY5, Aia3, (0) 
PRKARIB, PRKAR2B] 

[ARHGEF2, EFNA5, EPHA4, (0) 
MCF2L] 

[ADCY5, CACNAIC, (0) 
PRKAR1B,PRKAR2B, PRKGl] 

[NGFR, PSEN2, RASGRFl] (0) 



[ADCY2, CREBl, HPCA, (<0.01) 
PRKARIB, PRKAR2B] 

[ADCY2, ADCY5, HPCA, (<0.01) 
HTRIA, PRKARIB, PRKAR2B] 

[ADCY2, ADCY5, CREBl, (<0.01) 
PRKARIB, PRKAR2B, RORA] 
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Table 3 Estimates and enriched categories of differentially^ regulated non-common genes (Continued) 



Down-regulated [18] 



Eutopic 



Up-reguloted [19] 



Down-regulated [56] 



Neurophysiological process involving 
dopamine D2 receptor signaling 

Neurophysiological process in 
circadian rhythm 

Alpha-2 adrenergic receptor regulation 
of ion channels 

NGF signaling pathway in 
apoptosis and survival 

Transcription involving CREB pathway 

Role of activin A in cell 
differentiation and proliferation 

GH-RH signaling 

ZNF202 in regulation of expression 
of genes involved in atherosclerosis 

Regulation of lipid metabolism by 
niacin and isoprenaline 

Ligand-independent activation of 
ESRl and ESR2 

Relaxin signaling pathway 



Melanocyte development 
and pigmentation 

Immune response involving antigen 
presentation by MHC class I 

Vitamin B6 metabolism 

Blood coagulation and platelet 
degranulation 

Cholesterol and sphingolipids 
intracellular transport 

GSL metabolism 

Cell cycle at initiation of mitosis and 
regulation of Gl/S transition 

Dopamine D2 receptor transactivation 
of PDGF receptor 

Apoptosis and survival involving 
caspase cascade, FAS signaling cascade 
and HTRIA signaling and 
anti-apoptosis by external signals via NF-kB 

G-protein signaling involving regulation 
CDC42 activity 

Gultamate regulation of Dopamine 
Dl A receptor signaling 

PKA signaling 

Translation involving regulation of EIF2 

DNA damage involving NHEJ 
mechanisms of DSBs repair 

Cytoskeleton remodeling involving activin A 

Olfactory transduction 

Cell cycle involving chromosome 
condensation in prometaphase, 
sister chromatic cohesion, regulation of S 
phase and initiation of mitosis 



[ADCY2, ADCY5, CACNAIC, (<0.01) 
PRKARIB, PRKAR2B] 

[ADCAYl, CLOCK, CREBl, (<0.01) 
CACNAIC, RORA] 

[ADCY5, A^3, CACNAIC, (<0.01) 
PRKARIB, PRKAR2B] 

[AHTS, CAD, GABl] (<0.01) 



(<0.01) 

[ADCY2, ADCY5, CREBl, (<0.01) 

NR5A1/SF1, PRKARIB, 

PRKAR2B] 

[ADCY2, ADCY5, CACNAIC, (<0.01) 
CREBl, PRKARIB, PRKAR2B] 

[ADRB3, AP0L2, LPL] (<0.01) 



[ADCY2, ADCY5, ADRB3, (<0.01) 
PRKARIB, PRKAR2B] 

[ADCY2, ADCY5, AI<T3, (<0.01) 
PRKARIB, PRKAR2B] 

[ADCY5,A^3, CREBl, (<0.01) 
PRKARIB, PRKAR2B] 

[Aia3, CREBl, PRKARIB, (<0.01) 
PRKAR2B, PRKGl] 

[PDIA3] (<0.01) 



[PHPTl] (<0.02) 

[F13A1] (<0.03) 

[PPIA] (<0.03) 

(<0.05) 

[LMNB2, PPP2R3A] (<0.01) 

[PPP2R3A] (<0.01) 

[LMNB2, PPP2R3A] (<0.01) 

[ARHGAP17] (<0.01) 

[PPP2R3A] (<0.01) 

[PPP2R3A] (<0.02) 

[PPPICC] (0) 

[CSNK2A2] (<0.02) 

[FNTA] (<0.02) 

[0R2H1] (<0.02) 

[HISTIHIC] (<0.03) 
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Table 3 Estimates and enriched categories of differentially^ regulated non-common genes (Continued) 







Cadherin mediated cell adhesion 


[PTPRF] 


(<0.03) 






GABA-A receptor mediated 
neurophysiological process 


rnnn i /■ — i 

[PPPICJ 


(<0.03) 






MAG-dependent inhibition of 
neurite outgrowth 


[MAG] 


(<0.05) 






Cell adhesion via PLAU signaling 


[CSNK2A2] 


(<0.05) 


Ectopic 


Up-regulated [0] 










Down-regulated [91] 


G-protein signaling involving Rap2A 
regulation pathway and G-protein 
alpha-s signaling cascade 


[PRKAR2B, RAPGEF3] 


(0) 






Glycolysis and gluconeogenesis 


[EN02] 


(0) 






cAMP-Ca+2-dependent 
signal transduction 


[PRKAR2B, RAPGEF3] 


(<0.01) 






G protein mediated regulation of 
MAPK-ERK signaling 


[PRKAR2B, RAPGEF3] 


(<0.01) 






Development involving MAG, 
PACAP signaling, activin A, A2A 
and A2B receptor signaling and 
Hedgehog signaling 


[MYH14, NANUG, N 1 hi, 
PRKAR2B, RAPGEF3] 


(<0.01) 






Regulation of eNOS activity 


[PRKAR2B] 


(<0.01) 






CCR3 signaling in eosinophils 


[FGR, MYH14] 


(<0.02) 






NMDA dependent neurophysiological 
process 


[PRKAR2B, RAPGEF3] 


(<0.03) 
(<0.03) 






CFTR expression, maturation and activity 


[HSPA6, PRKAR2B] 


(<0.03) 



3-fold at P< 0.01. 



were selected under BROAD cancer gene neighborhood 
sets, C4. Table 5 also shows the major gene families 
selected in GSEA and names of the genes showing dif- 
ferential display in the comparison between eutopic and 
ectopic endometrium. 

Expressional cohort of marker genes 

Table 6 provides the list of selected twenty eight (28) 
genes that appeared significant from combined analysis 
of GSEA-selected gene sets followed by DC analysis and 
from DE analysis of microarray data of 18 paired sam- 
ples. Table 6 also shows that the validity of the pre- 
diction value of the expressional cohort based on 
quantitative analysis in a different set of autologous, 
paired eutopic and ectopic samples obtained from a sep- 
arate group of 8 subjects was markedly high. 

Discussion 

The awareness that whole genome expression array ana- 
lysis may yield high dimension knowledge towards deci- 
phering patho-etiology of complex diseases [20,21] has 
prompted several groups of investigators to employ this 
approach to examine the transcriptomics basis of endo- 
metriosis using eutopic and ectopic samples [3-9]. Al- 
though significant and interesting observations have 
emerged from these reports, these studies did not 



include the possible impact of one or more of the factors 
like the demographic characteristics, position of endo- 
metriosis, fertility history, severity stages and phases of 
menstrual cycle influencing the genomic expression in 
eutopic and ectopic tissues [2,8,22]. In the present study, 
we have examined the whole genome transcriptomics of 
autologous, paired eutopic and ectopic samples obtained 
from fertile Indian women with ovarian endometriosis of 
known clinical severity and phases of menstrual cycle 
but with no history of previous treatment for endometri- 
osis at the time of tissue collection. We analyzed the ex- 
pression profiles to delineate the impact of stages of 
severity and phases of cycle in eutopic and ectopic sam- 
ples. We observed that clustering effect on expression 
arrays was maximum in paired samples, followed by 
stages of clinical severity and positional cue. The phase 
of menstrual cycle exhibited minimal clustering effect on 
expressional profiles in the experimental samples. 

Generally, we observed that eutopic tissue yielded a 
normal frequency distribution histogram of gene expres- 
sions for different levels of expression and that an over- 
all higher numbers of genes in eutopic endometrium 
expressed higher transcriptomic signals as compared to 
ectopic samples; ectopic samples yielded a truncated fre- 
quency distribution histogram. Furthermore, higher 
numbers of genes bearing expression levels at the high 
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Figure 4 K-means classification for aii the expressed probes across eighteen (18) autologous, paired eutopic and ectopic endometrial 
samples. The groups of genes showing similar expression pattern across the samples are clustered in four clusters (K1-K4) (A). Cluster Kl contains 
3938 probes (3210 genes), cluster K2 contains 14901 probes (8194 genes), cluster K3 contains 6982 probes (6078 genes) and cluster K4 contains 
9825 probes (8505 genes). The right panel (B) shows the average expression patterns of co-expressed probes each cluster across the annotated 
groups. 



and at the low ends of the frequency distribution were 
observed in ectopic tissues as compared to eutopic tis- 
sues. We believe that implementation of appropriate 
computational models based on Shannons noise-signal 
entities and probability of size of loss of signals may 
yield in the future new leads about the global genomic 
expression pattern in the ectopic tissue [23,24]. It is 
notable in this regard that: (i) a large number (-0.7 K) 
of genes were silenced in the ectopic tissue at stage 4 
condition as compared to stage 3, and (ii) expressional 
clustering cohesion was very high (cd: 0.07) between 
the eutopic and ectopic endometrium in stage 4 disease 
condition. Taken together, it is suggestive of high degree 
of pathognomonicity in stage 4 eutopic endometrium 
[8,25]. 

Major highlights in the previous studies on large scale 
expressional array analysis were to explore the gene- 
specific DE in paired analysis between eutopic and ec- 
topic endometrium with an assumption that a 2-fold 



change at P < 0.05 between two groups of tissue samples 
was sufficiently significant for further analysis. As 
pointed out elsewhere, this may give rise to different sets 
of biases and inadequacies in interpretation and discov- 
ery [26,27]. To circumvent these acknowledged insuffi- 
ciencies, we have employed a 3-fold change at P<0.01 
as the pre-set filter for DE of individual genes between 
groups followed by pathway networks based enrichment 
analysis, and for DC analysis of K-mean based expres- 
sional clusters followed by gene set enrichment analysis 
(GSEA) model [16] to interpret the present transcrip- 
tomics data. 

Post-hoc analysis of expressional signals in eutopic and 
ectopic tissues under different sets of categorical com- 
parison revealed that several signaling pathways related 
to immune response were commonly affected in eutopic 
and ectopic endometrium. The results from the present 
study support the observation made by Zhao et al. based 
on GSEA of archival transcriptomics data sets of 
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Table 4 Estimates and description of top scored enriched 
categories of differentially regulated (eutopic-to-ectopic) 
co-expressed (DC) genes 

Cluster identity (Total number of genes) [Number of genes with 
differential display: Up-regulated/Down-regulated] 

Enriched category (p-value) Gene symbols of major 

candidate genes 
(Vector of differential) 



Cluster ^, K1 (3210) [208: 120/88] 

Cell cycle (<0.01) 



IGF-1 receptor signalling (<0.01) 

Cytoskeleton remodelling 
involving regulation of actin (<0.02) 

0-glycan biosynthesis (<0.02) 

Immune response(<0.02) 

G-protein signalling in 

TC21 regulation pathway (<0.02) 

DNA damage (<0.03) 

Neurite outgrowth (<0.05) 

Regulation of glucose and 
lipid metabolism (<0.05) 

G-protein signalling in 

Rapl regulation pathway (<0.05) 

Progesterone action (<0.05) 

Signal transduction: 

Erk interactions (<0.04) 

A\<J signaling (<0.05) 

Cluster 2, K2 (8194) [148: 0/148] 

Immune response (0) 

Cell adhesion (0) 

Regulation of CFTR (0) 

Signal transduction involving 
activin A signalling regulation (0) 

Slit-Robo signaling (<0.01) 

Cytoskeletal remodelling (<0.01) 

Glutathione metabolism (<0.01) 

Chemotaxis (<0.02) 

Apoptosis and survival (<0.02) 

Glucose and lipid metabolism 
(<0.03) 

Neurite outgrowth (<0.03) 
N-glycan biosynthesis (<0.04) 



CCNB1,CDC45L, CENPF, 
NCAPD3, RAD51 (up); 

CCND2, MYL9, 0RC6L, 
TNP01 (down) 

IGF2 (up), IGFBP5, 
CCND2 (down) 

MYH9, MYL9 (down) 

GALNTl, GALNT7 (down) 
PGR (up), CCND2 (down) 
RRAS2 (up), RASGRP2 (down) 

NBN, RAD51 (up), CCND2 (down) 

MYH9, MYL9 (down) 

CYP51A1, FOXA (up), 
TNPOl (down) 

KRITl (up), RASGRP2 (down) 
CCNBl, PGR (up) 

DUSP4 (up) 

HSP90AA1 (up), CCDD2 (down) 

ACTG1,C3, C4B, CIQB, HLA-C, 
PTPN11/SHP2 (down) 

ACTGl, ITGBl, 
PTPN11/SHP2 (down) 

ACTGl, HSPA8, PSMBl, 
TSGlOl, UBC (down) 

BAMBI, H3F3A, UBC (down) 

ACTGl, R0B03 (down) 
ACTGl, DSTN, RALGDS (down) 
GSTM5, MGST3 (down) 
ACTGl, GNG4, ITGBl (down) 
HSPA8, ACTGl (down) 
APOE (down) 

ACTGl, DSTN (down) 
GALT, GANAB (down) 



Table 4 Estimates and description of top scored enriched 
categories of differentially regulated (eutopic-to-ectopic) 
co-expressed (DC) genes (Continued) 



Cluster 3, K3 (6078) [102: 102/0] 




Differentiation (0) 


PRKCG (up) 


Transcriptional regulation of 


MAX, PRKCG (up) 


amino acid metabolism (0) 




Androgen receptor nuclear 


NC0A2/GRIP1, WNT16(up) 


signaling (<0.01) 




Neurophysiological process 


GLRA3 (up) 


associated with PGE2-induced 




pain processing (<0.01) 





endometriosis that the main canonical pathways puta- 
tively involved in the process of endometriosis were 
related to that of immune and inflammatory diseases 
[27]. Additionally, we hypothesize from the results of the 
present study that functional connectivity between over- 
expression of CLOCK and inflammatory disorder, as well 
as, between over-expression of genes associated with lipid 
metabolism and inflammation at the local tissue level are 
operative in the pathogenesis of endometriosis [28-30]. 

We also observed that neuronal processes involving 
nNOS signaling pathways and GABA synthesis and 
metabolism were commonly expressed in both tissue 
types and a large number of genes involving several 
signaling pathways (corticoliberin, opioid receptors, 
serotonin receptors, melatonin, dopamine receptor, 
neuronal cell adhesion, NGF) associated with neuro- 
physiological processes were up-regulated in stage 3 
ectopic endometrium. Earlier the possible involvement 
of neuroendocrine processes in the pathogenesis of 
endometriosis has been implicated [31-34]. Pathways 
and networks based enrichment analysis of DE of in- 
dividual genes and DC of gene cohorts in four clus- 
ters in the present study revealed that expression of 
genes in pathways directly and indirectly associated 
with cell apoptosis and survival, cytoskeleton remod- 
eling, chemotaxis and cell adhesion were differentially 
affected in eutopic and ectopic samples. Involvement 
of these pathways in endometriosis has earlier been 
reported by several groups based on different experi- 
mental models [35-38]. 

The pathogenesis of endometriosis has also been 
associated with excessive production of estrogens by 
up-regulated expression of aromatase and 17|3-HSD 
type 1, and suppression of 17p-HSD types 2 and 4, col- 
lectively resulting in an increased ratio of estradiol- 1 7 13 
to estrone in ectopic tissues [39-42]. It has however 
been reported by others that mRNA and protein 
expressions of aromatase were minimal in ectopic tis- 
sues [42,43]. Our transcriptomics data also failed to 
identify any overt change in the expression of genes for 
aromatase (CYP19A1) and 17|3-HSD (HSD17B1-B17) in 
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Table 5 Summary of GSEA results for K-mean clusters 

BROAD gene sets (Set description) 

Cluster Number of selection Total number of genesis- 
identity (set identity; p value; number of genes in 

qFDR; NES) classified gene families1,2 



CI (Cytogenetics) 

Kl 0 
K2 0 
K3 0 



Table 5 Summary of GSEA results for K-mean clusters 

(Continued) 



K4 



1 (Chr4ql2;0; 
0.11; 1.8) 



47; 6 (Cgf); 1 (Ong); 2 (Tf) 



C2 (Functional) 

Kl 0 
K2 0 
K3 0 
K4 0 
C3 (Regulatory motif) 
Kl 



K2 



K3 
K4 



1 (RRCCG™_UN; 0; 
0.25; -1.8); 

1 (V$AP4_q6; 0; 0.05; 
-2.0) 



1 (GCAAGA,MIR-431;0; 
0.04; 1 .9) 



C4 (Cancer gene neighborhood) 

Kl 2 (MORF_MYC; 

(0; 0.00; 2.3) 

(M0RF_ESR1; 
0; 0.04; 2.0) 

K2 0 
K3 0 

K4 2 (MORF_RAD23B; 0; 

0.18; 1.8) 

(GNF2_CCNA2; 0.02; 
0.23; 1 .8) 

C5 (Gene ontology) 

Kl 0 
K2 0 
K3 0 



55; 1 (Cgf); 2 (Cdm); 
2 (Ong); 3 (Pk); 1 1 (Tf) 

173; 1 (Cdm); 9 (Cgf); 
7 (Hdp); 9 (Ong); 11 (Pk); 

34 af); 1 as) 



39; 1 (Cdm); 2 (Cgf); 
1 (Hdp); 2 (Ong); 1 (Pk); 
6 af); 2 (Ts) 



72; 1 (Cdm); 2 (Cgf); 
5 (Hdp); 5 (One); 4 (Pk); 

14 af) 

160; 3 (Cdm); 7 (Cgf); 
7 (Hdp); 8 (Ong); 9 (Pk); 
36 af); 3 as) 



164; 1 (Cgf); 2 (Ong); 
1 (Pk); 8 iJf): 2 as) 

62; 1 (Cdm); 6 (Pk); 
3 iJf); 1 (Ts) 



K4 


2 (PA^ERN_ BINDING; 


45; 2 (Cdm); 6 (Cgf); 1 (Jf) 




0; 0.07; 2.1) 






(POLYSACCHARIDE, 


36; 6 (Cgf); 1 (Jf) 




BINDING; 0; 0.18; 1.9) 





^Archived from GSEA data base [15]. 

^Based on common feature in homology and biochemical activities, not 
necessarily having common origin. FDR <0.25, minimally 10 genes in a set and 
maximum 1000 permutation. NES, Normalized enrichment score. The major 
gene families in selected gene sets in GSEA are: Cdm, cell differentiation 
markers (CD6, CDM, CD34, HMMR, NRCAM , PSG1, PVR, PVRL3, TLR2, and 
TNFSF4); Cgf, cytokines and growth factors (BMP10, CXCL5, FGF8, FGF10, 
FGF12, GMFB, HBEGF, HDGF, HTN3, IL13, IL16, INHA, LTBP4, MDK, MLN, NLN, 
NPFF, NPPA, NRG2, NRTN , PDGFRA, PF4, PF4V1, PGF, PPBP, PR0K2, STC2, 
TNFSF4, T0R2A, VEGFA, VGF, and XCL2); Hdp, homeodomain proteins (BARX2, 
H0XA6, H0XB5, H0XC4, H0XC12, H0XD4, LBX1, LHX8, MSX1, PAX7, PAX8, 
PITX3, P0U6F1, SIX3, and SIX5); Ong, oncogenes (ACT ARNT BCL2, BCL1 IB, 
BFA2T3, CNBP, ELF4, ERC1 , EWSR1, GAS7, LCK, MSI2, MYC, NCKIPSD, NR4A3, 
PAX7, PAX8, PDGFRA, PICALM, RUNX1, TPM3, and TRIM24); Pk, protein kinases 
(ADRBK1, AURKA, AURKB, BUB1B, CAMK2G, DAPK2 , EPHA2, ERN1, FLT1, GRK4, 
IKBKE, KSR2, LCK, MAP3K13, MAPK2K4, MAPK6, MELK, MYLK, NRK, OBSCN, 
PLK1, PRKACA, PRKCQ, PTK7, RPS6KB2, SNG1, STK16, STK17A, TRIM24, TTK, and 
WEED; Tf, transcription factors (AFF2, ARNT ATF7IP, ATRX, BACH1, BARX2, 
BCL1 IB, BRD1, CBFA2T3, CBX6, CLOCK , CNBP, C0PS5, CREB3L1, EGR3, ELF4, 
EP400, ESR1 , ESRRA, ETB3, EYA1, EYA3, EZH2, EZH2, F0SL1, F0SL2, F0XD3, 
F0XM1, GATA3, HDSE4, HMGB2, HMGN2, H0XA6, H0XB5, H0XC4, H0XC12, 
H0XD4, ID1, ID3, ILF3, IRF2, KLF13, LBX1, LHX8, MAZ, MBD4, MGA, MORF4L2, 
MSX1, MTA1, MYC , NEUR0G3, NF1, NFIA, NFIC, NFRKB, NFX1, NR1I2, NR3C2, 
NR4A3, PAX7, PAX8, PAX9, PAXIP1, PHTF1, PIAS2, PITX3, P0U6F1, PTF3L1, REST 
RUNX1, RXRG, SART3, SF1 (NR5A1) , SIX3, SIX3, SIX5, SMAD4, SMARCA2, 
SMARCA5, S0X12, SP3, TAF2, TBX15, TBX19, TBX5, TCF7, TFAP4, TFDP2, 
TRIM24, UBTF, VPS72, WT1, ZBTB22 , ZNF133, ZNF134, ZNF146, ZNF157, 
ZNF238, and ZNF592); Ts, tumor suppressors ( BRCA1 , BUB1B, GATA3, 
MAPK2K4, NF1, SDHC, SMAD4, WT1, and XPC). The genes that showed 
differential expression between eutopic and ectopic samples in the present 
study are marked by underlines. 



endometriosis. However, our observation that genes 
(NR5A1, STAR) for steroidogenic factor (SF)1 and ster- 
oidogenic acute regulatory protein (StAR), which are 
known to be significant regulators of steroidogenesis 
[44,45] were highly expressed in the ectopic endomet- 
rium substantiates previous reports [46-48]. 

An over-expression of ERBB3 in proliferative phase 
eutopic endometrium and secretory phase ectopic endo- 
metrium was seen in the present study; it has been asso- 
ciated with ligand-independent activation of estrogen 
receptors (ESRl and ESR2) in target tissues [49]. How- 
ever, ERBB3 was up-regulated in eutopic tissues as com- 
pared to ectopic tissues in pooled analysis. The 
activation of ESRs and relative down- regulation of pro- 
gesterone receptor (PGR) in the secretory phase ectopic 
endometrium is suggestive of relative suppression of 
progesterone action in the ectopic endometrium. Indeed, 
the phenomenon of progesterone resistance in the ec- 
topic endometrium has earlier been documented by 
other groups [8,30,50]. 

Collectively, it appears that the results of our study 
corroborate well with the previous reports and lead us 
to hypothesize that eutopic endometrium which is 
transcriptionally dysfunctional in mediating immune- 
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Table 6 Selected genes expression of which bear 
predictable leads to ovarian endometriosis among Indian 
women 



Expression characteristics 


Gene symbol (GenBank ID) 


Fold change^ 


[Gene name] 


Microarrayb RT-PCR*" 


Up-regulated in eutopic tissue 






General 






3HMGN2 (NM_005517) 


4.8+1 .0 


2.4++0.3 


[High-mobility group nucleosomol 
binding donna in 2] 






MKI67 (NM_002417) 


3.4+0.9 


4.6+1.8 


[Ki-67-lil<e antigen] 






NRCAM (NM_001037132) 


4.5+1 .6 


2.1+0.5 


[Neuronal cell adhesion nnolecule]* 






PARC (NM_003631) 


3.8+2.1 


3.4+1 .5 


[Poly (adp-ribose) glyco hydrolase] 






TMPO (NM_00 1032283) 


3.1+1.9 


2.8+1.1 


[Thymopoietin] 






Stage 3 






ATP2A2 (NM_001681) 


5.1+1.9 


8.6+3.5 


[ATPase, Ca++ transporting, cardiac 
m useless low twitch 2] 






CHIA (NM_021797) 


3.4+1.3 


7.3+1.5 


[Chitinase, acidic] 






DAPK2 (NM_014326) 


6.2+2.3 


9.7+1 .8 


[Death-associated protein kinase 2]* 






bHCl (Nl\/l_l 78040) 


6.8+1 .5 


10.8+3.0 


[Elks/rab6-interacting family member 1]* 






TACC2 (NM_206862) 


13.7+3.2 


6.4+2.0 


[Transforming, acidic coiled-coil 
containing protein 2] 






ZBTB22 (NM_005453) 


5.2+2.7 


12.3+3.1 


[Zinc finger and BTB domain 
containing 22]* 






Proliferative phase 






BAGS (NM_001015049) 


4.2+1 .9 


6.5+4.1 


[Bcl2-associated athanogene 5] 






CDCA3 (NM_031299) 


3.9+1 .5 


9.9+6.6 


[Cell division cycle associated 3] 






EGR3 (NM_004430) 


3.7+1 .0 


5.3+1.4 


[Early growth response 3]* 






FGFBPl (NM_005130) 


5.7+2.1 


7.3+2.7 


[Fibroblast growth factor binding protein 1] 






TPM3 (NM_001 043352) 


3.4+1 .9 


6.3+3.4 


[Tropomyosin 3] 







3.6+1.7 5.9+1.9 



Table 6 Selected genes expression of which bear 
predictable leads to ovarian endometriosis among Indian 
women (Continued) 

Secretory phase 

DHRS3 (NM_004753) 

[Dehydrogenase/reductase (SDR family) member] 
Up-regulated in ectopic tissue 

General 

BAPl (NM_004656) 

[BRCA 1 associated protein- 1 ]* 

CBLLl (NM_024814) 

[Cas-Br-M (murine) ecotropic retroviral 
transforming sequence-like 1 ] 

CLOCK (NM_004898) 

[Clock homolog (mouse)]* 

EPB41L1 (NM_012156) 

[Erythrocyte membrane protein 
band 4. 1-1 ike 1] 



12.6+3.3 13.9+3.4 



6.2+2.5 6.1+2.5 



9.0+2.6 5.3+0.6 



3.1+1.6 2.6+1.2 



LPL (NM_000237) 

[Lipoprotein lipase] 

PPAT (NM_002703) 

[Phosphoribosyl pyrophosphate 
amidotransferase] 

SCAMPI (NM_004866) 

[Secretory carrier membrane protein 1] 

SFRSl (NM_001078166) 

[Splicing factor, arginine/serine-rich 1 ] 

USP46 (NM_022832) 

[Ubiquitin specific peptidase 46] 

YWHAE (NM_006761) 

Uyrosine 3-monooxygenase/tryptophan 
5-monooxygenase activation protein, epsilon] 

ZNF644 (NM_201269) 

[Zinc finger protein 644] 



5.5+2.9 7.2+2.C 



5.8+2.9 5.6+3.4 



10.5+3.2 6.0+2.^ 



^.7+3.6 6.6+2.2 



4.0+1.8 3.7+1.1 



3.4+2.7 3.3+1.^ 



4.4+1.0 5.1+2.9 



^Between eutopic and ectopic samples, shown as mean±SD. 
"^Based on GSEA implementation on DC results and their DE analysis (n = 18). 
^Based on transcript numbers from RT-PCR analysis of RNA samples from a 
different set of autologous paired eutopic and ectopic tissues (n = 8). 
*Genes selected in gene families identified in GSEA gene sets (see Table 5). 



neuro-endocrine responses may bear vulnerability to 
give rise to endometriotic lesion if deposited ectopically. 
Subsequently, the ectopically placed endometrial tissue 
with positional input and under fluctuating levels of sex 
steroid hormones in vulnerable subjects may develop 
differential expression repertoire related to cell survival, 
adhesion, migration and growth resulting in endometrio- 
tic lesion [51,52]. More over, it appears that a pathways- 
network of several transcription factors including 
CLOCK-ESRl-MYC may be involved at the transcrip- 
tomic level towards pathoetiology of ovarian endometriosis 
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Membrane Cytoplasm 



Nucleus 




^ Regulators J Generic binding protein 

>^ Transcription factor >^ Generic enzyme 



Positive effect 
Negative effect 
Unspecified effect 



Figure 5 Knowledge-based construction of the pathways-network of transcription factors putatlvely associated with pathogenesis of 
endometriosis. The transcription factors were identified from GSEA implementation on co-expressed genes. It is notable that CLOCK, ESRl, and 
MYC (shown inside blue dotted rectangle) are differentially co-expressed in endometriosis as shown is Table 5. 



(Fig. 5). Further studies are warranted to test this 
hypothesis. 

Endometriosis, by definition, is a benign disease, how- 
ever, there are a few reports indicating risk of malignant 
transformation in endometriosis [53-55]. In the present 
study, we observed a general suppression in the expres- 
sion of genes associated with cell cycle and DNA dam- 
age repair in both eutopic and ectopic endometrium in 
fertile women with endometriosis. Interestingly, there is 
a recent report indicating a better survival rate for 
women with endometriosis for all malignancies com- 
bined, and specifically for ovarian and breast cancer, 
while it was poorer in malignant melanoma [56]. While 
the present results revealing the lack of overt oncogenic 



potential in endometriotic tissue concur with some of 
the earlier reports [6-8], genes (CHEKl, ERBB family, 
laminin gamma and Ki-67) associated with gynecological 
cancers [57-60] were highly expressed in autologous, 
paired eutopic and ectopic tissues. Thus, the possibility 
of inducement of oncogenic transformation through crit- 
ical phase transition [61] in the course of endometriosis 
disease progression cannot be ruled out, especially in the 
high risk population [62,63]. 

Finally, we have identified for the first time a cohort of 
twenty-eight (28) genes with high degree of predictability 
index for ovarian endometriosis in fertile women. We 
believe this cohort of genes can be used for further study 
to discover patho-physiology of ovarian endometriosis. 
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Conclusions 

Expressional profiles between paired eutopic and ectopic 
samples showed markedly higher cohesion compared 
with that of clinical stages of severity and phases of 
menstrual cycle. Endometriotic endometrium displayed 
anomalous expressional balance for several genes asso- 
ciated with immunological, neuracrine and endocrine 
functions. Although no overt oncogenic potential in 
endometriotic tissue was observed, expressions of a few 
genes (CHEKl, ERBB family, laminin gamma and Ki-67) 
associated with gynecological cancers were seen to be 
up-regulated. A novel cohort of twenty-eight (28) genes 
representing potential marker for ovarian endometriosis 
in fertile women was discovered. 
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